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"Est enim fere ex pulcherrimis qu<£ solvere desiderem. " 
(It is among the most beautiful I could desire to solve.) [Newton 1676] 



Abstract. Starting with a novel definition of divided differences, this 
essay derives and discusses the basic properties of, and facts about, (uni- 
variate) divided differences. 
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1 Introduction and basic facts 

While there are several ways to think of divided differences, including the one 
suggested by their very name, the most efficient way is as the coefficients in a 
Newton form. This form provides an efficient representation of Hermite inter- 
polants. 

Let II C (IF — > IF) be the linear space of polynomials in one real (IF = H) or 
complex (IF = C) variable, and let il<„ denote the subspace of all polynomials 
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of degree < n. The Newton form of p S II with respect to the sequence t = 
(t\,t2, ■ ■ •) of centers tj is its expansion 

oo 

P= : J2 W 3-l,tc(j) (1) 

i=i 

in terms of the Newton polynomials 

Wi := w it t :=(•-*!)•■■(• -ti), i = 0,l,.... (2) 

Each does, indeed, have exactly one such expansion for any given t since 

degWjj — j, all j, hence (ttij-ij ■ j 6 IN) is a graded basis for II in the sense 
that, for each n, (wj-ij '■ j — 1-ri) is a basis for n<„. 
In other words, the column map 

oo 

Wf.W™ ^n:c^^ Wj _ lit c(j) (3) 
j=i 

(from the space TF^ of scalar sequences with finitely many nonzero entries to 
the space II) is 1-1 and onto, hence invertible. In particular, for each n £ IN, 
the coefficient c(n) in the Newton form (1) for p depends linearly on p, i.e., 
p i ► c(n) = (W t _ p)(n) is a well-defined linear functional on II, and vanishes 
on n<„_i. More than that, since all the (finitely many nontrivial) terms in (1) 
with j > n have w n> t as a factor, we can write 

p = p n + w n , t q n , (4) 
with q n a polynomial we will look at later (in Example 6), and with 

n 

Pn ■= ^2wj- llt c(j) 
3=1 

a polynomial of degree < n. This makes p n necessarily the remainder left by 
the division of p by w n j, hence well-defined for every n, hence, by induction, 
we obtain another proof that the Newton form (1) itself is well-defined. 
In particular, p n depends only on p and on 

f-l-.n • — (^1 5 • • • > tn)t 

therefore the same is true of its leading coefficient, c(n). This is reflected in the 
(implicit) definition 



oo 

p =: ^2wj- ht A(t 1:j )p, pen, 



(5) 



48 



C. de Boor 



in which the coefficient c(j) in the Newton form (1) for p is denoted 

A(t 1:j )p = A(t u ...,t 3 )p := ((m)"V)(i) (6) 

and called the divided difference of p at ti, . . . , tj. It is also called a divided 
difference of order j — 1, and the reason for all this terminology will be made 
clear in a moment. 

Since W t is a continuous function of t, so is , hence so is A(ti : j) (see 
Proposition 21 for proof details). Further, since Wjj is symmetric in t\,...,tj, 
so is A(t\ : j). Also, A(ti : j) _L n <:) (as mentioned before). 

In more practical terms, we have 

Proposition 7. The sum 

n 

Pn = ^2' r j l..< ;•/> 

J=l 

of the first n terms in the Newton form (1) for p is the Hermite interpolant to p 
at t\- n , i.e., the unique polynomial r of degree < n that agrees with p at i 1:n in 
the sense that 

D l r(z) = D l p(z), < i < Mz := #{j S [1 . . n] : tj = z}, z 6 F. (8) 

Proof: One readily verifies by induction on the nonnegative integer /i that, for 
any z € F, any polynomial / vanishes /i-fold at z iff / has (• — zY as a factor, 
i.e., 

D7(z)=0 for i = 0,...,p-l <^=> /e(--z)^n. (9) 

Since p — p n = w n ,tq n , this implies that r — p n does, indeed satisfy (8). 

Also, p„ is the only such polynomial since, by (9), for any polynomial r 
satisfying (8), the difference p n — r must have w n as a factor and, if r is of 
degree < n, then this is possible only when r = p n . □ 



Example 1. For n = 1, we get that 

A(ti) :pnp(ii), 

i.e., A(t) can serve as a (nonstandard) notation for the linear functional of 
evaluation at r. □ 
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Example 2. For n = 2, p n is the polynomial of degree < 2 that matches 
p at ti-2- If ti ^ t 2 , then we know p2 to be writable in 'point-slope form' as 

P2 = P(tl) + (■ - tl) , 

12 — tl 

while if t\ — t 2 , then we know p 2 to be 

P2=p(h) + (--t 1 )Dp(t 1 ). 

Hence, altogether, 

r P (t 2 ) - P (h) , 

A(ti :2 )p = ^ *2-*i ' ^ 2 ' (10) 
[ Dp(ti), otherwise. 

Thus, for ti ^ t2, ^(ii:2) is a quotient of differences, i.e., a divided difference. □ 

Example 3. Directly from the definition of the divided difference, 

A(t 1:j )wi- l!t = Sji, (11) 
therefore (remembering that A(ti : j) _L n <J _i) 

MtujW 1 - 1, (12) 

with 

a handy if nonstandard notation for the power functions. □ 

Example 4. If t is a constant sequence, t = (r, r, . . .) say, then 
Wj,(r,T,...) = (' - T )^ 

hence the Taylor expansion 

oo 

p = Y,{--r) n D n p{r)/n\ (13) 

71 = 

is the Newton form for the polynomial p with respect to the sequence (r, r, . . .). 
Therefore, 

A(r[™ +1 ])p:= A(w> = fl"p(T)/»!, n = 0, 1, . . . . (14) 

n+1 terms 

□ 
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Example 5. If £ : t >— ► at + b, then (£(z) — t(ti)) = a(z — ti), hence 

a"- 1 A(e(t 1:n ))p = A(t 1:n )(po£). (15) 

□ 

Example 6. Consider the polynomial q n introduced in (4): 

p = p n + w ntt q n . 

Since p(t n+1 ) = p n+ i{t n+1 ) and p n+1 = p n + w„ ;t A(t 1:n+ i)p, we have 
w ntt (t n+ i)q n (t n+ i) = w ntt (t n+ i)A(t 1:n+1 )p, 

therefore 

q n (t n +i) = A(t V . n+ i)p, 

at least for any i„ +1 for which w n ^ t (t n+ i) ^ 0, hence for every £„ +1 £ F, by 
the continuity of q n , and the continuity of A(ti :n ,-)p, i.e., of A(ti :n+ i)p as a 
function of t n+ \. It follows that 

q n = A(ii :n , -)p 

and 

p = p n + w ntt A(ti:n,-)P, (16) 

the standard error formula for Hermite interpolation. More than that, by the very 
definition, (4), of q n , we now know that 

A(t 1:n ,-)p = q n = (p - p n )/w n , t = Wj ~ 1,t A(t 1:j )p, (17) 

and we recognize the sum here as a Newton form with respect to the sequence 
(tj : j > n). This provides us with the following basic divided difference identity: 

A(i n+ i :j -)A(*i :n , •) = A(t 1:j ), j > n. (18) 

□ 

For the special case n = j — 2, the basic divided difference identity, (18), 
reads 

A(^_ 1:j )A(t 1:j _ 2 ,-) = A(t 1:j ), 
or, perhaps more suggestively, 

A(tj_l,-)A(tl :J -_2,-) = A(tl:j_l,-), 

hence, by induction, 

Afe-i.OAfe-j,-) •■■A(ti,-) = A(ti : j_i, •). (19) 

In other words, A(ti : j) is obtainable by forming difference quotients j — 1 times. 
This explains our calling A(ti : j) a 'divided difference of order j — V . 



Divided Differences 51 

2 Continuity and smoothness 

The column map 

oo 

W t :F^n:c^]T Wj _ MC (j) 
j'=i 

introduced in (3) is continuous as a function of t, hence so is its inverse, as 
follows directly from the identity 

A- 1 -B- 1 =A~ 1 (B-A)B- 1 , (20) 

valid for any two invertible maps A, B (with the same domain and target). 
Therefore, also each A(ti : j) is a continuous function of t, all of this in the 
pointwise sense. Here is the formal statement and its proof. 

Proposition 21. For any p G II, 

lim(A(s 1: ,)p :j6K) = (A(t 1:j )p : j £ N). 

S— >t 

Proof: Let p E II <n . Then t i— > A(ti ; &)p = for k > n, hence trivially 
continuous. As for A; < n, let 

n 

W t ,„ := F -» n<„ : c » ^ ^- 1;t c(j) 

be the restriction of Wt to IF™, as a linear map to II< n . Then, in whatever 
norms we might choose on IF™ and Il< n , Wt, n is bounded and invertible, hence 
boundedly invertible uniformly in ti- n as long as ti :n lies in some bounded set. 
Therefore, with (20), since lim s ^ t W s , n — W t: n, also 

lim(A(si :j )p : j = l:n) = {Wt^^p = {A(t 1:j )p : j = l:n). 

□ 

This continuity is very useful. For example, it implies that it is usually 
sufficient to check a proposed divided difference identity by checking it only for 
pairwisc distinct arguments. 

As another example, we used the continuity earlier (in Example 6) to prove 
that A(ti: n , -)p is a polynomial. This implies that A(ii :n , -)p is differcntiablc, 
and, with that, (18) and (14) even provide the following formula for the deriva- 
tives. 

Proposition 22. 

D k A(t 1:j , > = fc!A(ii :j -, l-] k+1 ) P , k e N. 
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3 Refinement 

Already Cauchy [Cauchy 1840] had occasion to use the simplest nontrivial case 
of the following fact. 

Proposition 23. For any n-sequence t and any 1 < er(l) < • • • < a(k) < n, 

<r(k)-k 
J'=«t(1)-1 

with a — at.a positive in case t is strictly increasing. 
Proof: Since A(ti :n ) is symmetric in the tj, (18) implies 

(t n - ii)(A(i 1: „\ m ) - A(i 2: „)) = {ti - t rn ){A(t 2 :„) - A(tl :n _l)), 

with 

^l:n\m • ^l:m— l,m+l:n • (^1? ■ • ■ 7 ^m— 15 • ■ • ; ^n)- 

On rearranging the terms, we get 

(i n -ti)A(ti : „\ m ) = (t n - t m )A(t 2:n ) + (i m - tl)A(ti : „_i), 

and this proves the assertion for the special case k = n — 1, and even gives an 
explicit formula for a in this case. 

From this, the general case follows by induction on n— k, with a computable 
as a convolution of sequences which, by induction, are positive in case t is strictly 
increasing (since this is then trivially so for k = n — 1), hence then a itself is 
positive. □ 

My earliest reference for the general case is [Popoviciu 1933]. 

4 Divided difference of a product: Leibniz, Opitz 

The map 

P := P n>t : n -» II : p i-> p„, 
of Hermite interpolation at ti :n , is the linear projector P on II with 

ranP = n<„, ran(id — P) = nullP = w t , n H- 

In particular, the nullspace of P is an ideal if, as we may, we think of II as a 
ring, namely the ring with multiplication defined pointwise, 



(pg)(z) :=p(z)g(z), zeJF. 
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In other words, the nullspacc of P is a linear subspace closed also under point- 
wise multiplication. This latter fact is (see [de Boor 2003b]) equivalent to the 
identity 

P(pq) = P(p(Pq)), p,qen. (24) 
For p G II, consider the map 

m p : n<„ - n<„ : / i ► P(pf). 

Then M p is evidently linear and, also evidently, so is the resulting map 

M : IT — > L(II <n ) :p^M p 

on II to the space of linear maps on IT< n . More than that, since, by (24), 

M„f = P( P qf) = P{pP(q.f)) = M p M q f, p e n<„, P ,qen, 

M is a ring homomorphism, from the ring II into the ring L(Tl <n ) in which 
composition serves as multiplication. The latter ring is well known not to be 
commutative while, evidently, ranM is a commutative subring. 
It follows, in particular, that 

M p =p(M {)1 ), peU, 

hence 

m p =p(Mqi), pen, 

for the matrix representation 

M p := VMpV' 1 

of M p with respect to any particular basis V of n<„. Look, in particular, at 
the matrix representation with respect to the Newton basis 

V := : j = l:n] 

for n<„. Since 

O^j-i.t = tjWj-^t + Wj,t, 3 = 1, 2, • • • , 
therefore evidently 

MQiWj- ltt = Pitfwj-xj) = tjWj- lit + (1 - Sj,n)wj,t, j = l:n. 
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Consequently, the matrix representation for Mqi with respect to the Newton 
basis V is the bidiagonal matrix 



Mqi = A n>t : = 



On the other hand, for any p G IT and j = l:n, 

\ 

y^{wi-i )t /wj- 1<t )A(tj : i)p Wj-i,t 

i=j J 

is a polynomial of degree < n and, for pairwise distinct U, it agrees with pwj-ij 
at t\-, n since the sum describes the polynomial of degree < n — j that matches p 
at tj-n while both functions vanish at ii^-i- Consequently, with the convenient 
agreement that 

A(tj..i) : 0, j > i, 

we conclude that 

n 

P(pwj-i,t) = ^2w^i^ t A(t r .i)p, j = l:n, 

i=l 

at least when the U are pairwise distinct. In other words, the jth column of the 
matrix M p = V~ 1 M p V (which represents M p with respect to the Newton basis 
V for II <n ) has the entries 

(A(t j:i )p : i = l:n) = (0, . . . , 0,p(tj), A(tj, t j+ i)p, . . . , A(t j:n )p). 

By the continuity of the divided difference (see Proposition 21), this implies 
Proposition 25: Opitz formula. For any p 6 II, 

p(A n , t ) = {A(t j . A )p:i,j = l:n). (26) 

The remarkable identity (26) is due to G. Opitz; see [Opitz 1964] which 
records a talk announced but not delivered. Opitz calls the matrices p(A n ^) 
Steigungsmatrizen ('difference-quotient matrices'). Surprisingly, Opitz explic- 
itly excludes the possibility that some of the tj might coincide. [Bulirsch ct al. 
1968] ascribe (26) to Sylvester, but I have been unable to locate anything like 
this formula in Sylvester's collected works. 



1 h 

i h 
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Example 7. For the monomial () fe , Opitz' formula gives 
A(ii : „)() fc = {A n>t ) k {n,l) = KMi v k)A n , t {vk,Vk-i) ■ ■ ■ A n , t {u u l), 

ve{l:n} k 

and, since A n ^ is bidiagonal, the vtYi summand is zero unless the sequence 
(1, v\, . . . , Vh, n) is increasing, with any strict increase no bigger than 1, in which 
case the summand equals t a , with ctj — 1 the multiplicity with which j appears 
in the sequence v, j = \:n. This confirms that A(ti :n )() k — for k < n— 1 and 
proves that 

Mtv.n)() k = ]T t a , k>n-l. (27) 

\a\— k — n— 1 

□ 

My first reference for (27) is [Steffenscn 1927: p.l9f]. To be sure, once 
(27) is known, it is easily verified by induction, using the Leibniz formula, to 
be derived next. 

Since, for any square matrix A and any polynomials p and q, 

(pq)(A)=p(A)q(A), 

it follows, in particular, that 

Mh:n)(pq) = M^ q {n, 1) - M p {n, :)M q {:, 1), 

hence 

Corollary 28: Leibniz formula. For anyp,q £ LT, 

A(ti :n )(pg)= ]T /K(t r . n ) P A(t 1:3 )q. (29) 

j=l:n 

On the other hand, the Leibniz formula implies that, for any p, q e LT, 

{A(t j:i )p : i,j = l:n){A(t j:i )q : i,j = l:n) = (A(t j: i)(pq) : i,j = l:n), 

hence, that, for any p G IT, 

p((A(t j:i )() 1 : i,j = l:n)) = {A(t j:i )p : i,j = l:n). 

In other words, we can also view Opitz' formula as a corollary to Leibniz' 
formula. 

My first reference for the Leibniz formula is [Popoviciu 1933], though Stef- 
fensen later devotes an entire paper, [Steffenscn 1939], to it and this has become 
the standard reference for it despite the fact that Popoviciu, in response, wrote 
his own overview of divided differences, [Popoviciu 1940], trying, in vain, to 
correct the record. 

The (obvious) name 'Leibniz formula' for it appears first in [de Boor 1972]. 
Induction on m proves the following 

Corollary 30: General Leibniz formula. For f : W m — *■ F, 

A(t u ...,t k )f (■,...,■)= ]T (®]LiMtiu-i),---,t i{j) ))f. 

l=i(l)<...<i(m)=fe 
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5 Construction of Newton form via a divided difference table 

Divided difference table. Assume that the sequence (ti, . . . , t n ) has all its 
multiplicities (if any) clustered, meaning that, for any i < j, ti — tj implies that 
U = t l+ i = ■■■ =tj. Then, by (18) and (14), 

( A(t i+ l:j)p- A(t i; j-l)p t .U t .. 

MU-.j)p = < *j — *i * 1 < i < j < n. 

[ Di~ l p(ti)/(j — i)\ otherwise, 

Hence, it is possible to fill in all the entries in the divided difference table 

A(h)p 

A(t 1:2 )p 

A(t 2 )p A(t 1:3 )p 
A{t 2 : 3 )p 

A(t 3 ) P 

A(t„_ 3:n _i)p 

A(i n _ 2: „-l)p 
A(t n -l)p A(t n -2:n)p 
A(t n -l-.n)p 

A(t n )p 

column by column from left to right, using one of the n pieces of information 

y(j):=D^p(t 3 ), fij := #{i < j : U = tj}; j = l,...,n, (31) 

in the leftmost column or else whenever we would otherwise be confronted with 
0/0. 

After construction of this divided difference table, the top diagonal of the 
table provides the coefficients (A(ti-.j)p : j — l,...,n) for the Newton form 
(with respect to centers t\, . . . ,t n -\) of the polynomial of degree < n that 
matches p at ti :n , i.e., the polynomial p n . More than that, for any sequence 
(ii, . . . , i n ) in which, for each j, . . . ,ij} consists of consecutive integers in 
[1. .n], the above divided difference table provides the coefficients in the Newton 
form for the above r, but with respect to the centers (t^. : j — l:n). 

Now note that the only information about p entering this calculation is the 
scalar sequence y described in (31). Hence we now know the following. 

Proposition 32. Let (ti, . . . , t n ) have all its multiplicities (if any) clustered, 
and let y € F* be arbitrary. For j = 1, . . . , n, let c(j) be the first entry in the 
jth column in the above divided difference table as constructed in the described 
manner from y. 



A(ti :n _i)p 

A(t 1:n )p 

A(t 2 ;n)P 
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Then 

n 

<■ ■ X! W 3-hAj) 
3=1 

is the unique polynomial of degree < n that satisfies the Hermite interpolation 
conditions 

D"'r(*i) = y(j), fij := #{i < j : U = tj}; j = l,...,n. (33) 

6 Evaluation of a Newton form via Horner's method 
Horner's method. Let c(j) :— A(ti-.j)r for j = 1, . . . , n > degr, z £ JF, and 
c(n) := c(n), 

c(i) := c(j) + (z- tj)c(j + 1), j = n-1, n-2, . . . , 1. 
Then c(l) = r(z). More than that, 

n 

r = ^w._ lT c(i), 

3=1 

with 

t := (z,ti,h, ■ ■ •)• 

Proof: The first claim follows from the second, or else directly from the fact 
that Horner's method is nothing but the evaluation, from the inside out, of the 
nested expression 

r(z) = c(l) + {z- ti)(c(2) + ■■■ + («- t n _ 2 )(c(n - 1) + (« - t„_i)c(n)) • • ■), 

for which reason Horner's method is also known as Nested Multiplication. 

As to the second claim, note that A(z, ii :n _i)r = A(ti :n )r since degr < n, 
hence c(n) — A(z, t\- n -\)r, while, directly from (18), 

A(-,t 1:j -_i) = A(t 1:j ) + (--tj)A(;t 1:j ), jeW, (34) 

hence, by (downward) induction, 

c(j) = A{z,ti-.j- 1 )r, j = n-1, n-2, . . . , 1. 

□ 
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In effect, Horner's Method is another way of filling in a divided difference 
table, starting not at the left-most column but with a diagonal, and generating 
new entries, not from left to right, but from right to left: 



A{z)r 

A(ti)r A(z,t 1:2 )r 
A(t 1:2 )r 



A(t 1:3 )r A(z,t 1:n - 2 )r 

M*,h:n-i)r = A(ti :n )r 

Hh:n-l)r 

A(t 1:n )r 



Hence, Horner's method is useful for carrying out a change of basis, going from 
one Newton form to another. Specifically, n — 1-fold iteration of this process, 
with z = z n -i, ■ ■ ■ , zi, is an efficient way of computing the coefficients (A(zi-.j)r : 
j = 1, . . . , n), of the Newton form for r E n<„ with respect to the centers zi :n -i, 
from those for the Newton form with respect to centers ti :n _i. Not all the steps 
need actually be carried out in case all the Zj are the same, i.e., when switching 
to the Taylor form (or local power form). 



7 Divided differences of functions other than polynomials 

Proposition 35. On II, the divided differences A(t\ : j), j = 1, . . . ,n, provide 
a basis for the linear space of linear functionals spanned by 

A.: /,,/.>" , //, := #^ <j:U = tj}; j = l,...,n. (36) 

Proof: By Proposition 7 and its proof, 

n™ =1 ker A(t 1:j ) = w n>t Il - nj =1 kcr A{tj)D^ . 

Another proof is provided by Horner's method, which, in effect, expresses 
(A(ti : j) : j = l:n) as linear functions of (A(tj)D^ : j — l:n), thus showing 
the first sequence to be contained in the span of the second. Since the first is 
linearly independent (as it has (wj—ij ■ j = l'n) as a dual sequence) while both 
contain the same number of terms, it follows that both are bases of the same 
linear space. □ 
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This proposition provides a ready extension of A(ti :n ) to functions more 
general than polynomials, namely to any function for which the derivatives men- 
tioned in (36) make sense. It is exactly those functions for which the Hermite 
conditions (33) make sense, hence for which the Hermite interpolant r of (32) 
is defined. This leads us to G. Kowalewski's definition. 

Definition 37 ([G. Kowalewski 1932]). For any smooth enough function 
f defined, at least, at t\,...,t n , A(ti : „)/ is the leading coefficient, i.e., the 
coefficient of ()" _1 , in the power form for the Hermite interpolant to f at t\- n . 

In consequence, A(ii : „)/ — A(ti :n )p for any polynomial p that matches f 



Example 8. Assume that none of the tj is zero. Then, 

A^Xr 1 = (-l)"-7(*i •••*«)• (38) 

This certainly holds for n = 1 while, for n > 1, by (29), = A(fi :n )(() _1 () 1 ) = 
A(ii :n )()- 1 i„ + A^i^.^Q" 1 , hence A(ti : „)()" 1 = - A^-iX)" 1 /^, and 
induction finishes the proof. This implies the handy formula 

A(t 1:n )(z- -y 1 = l/w n , t (z), z^h,...,t n . (39) 

Therefore, with := #{j : £ = tj, 1 < j < n} the multiplicity with which £ 
occurs in the sequence t\- n , and 



the partial fraction expansion of l/w ntt , we obtain Chakalov's expansion 

A(to,...,t fc )/ = 2 E ^"/(0 ( 4 °) 
«et o< M <#e 

(from [Chakalov 1938]) for / := l/(z — •) for arbitrary z since D fM l/(z — •) = 
fi\/(z— •) Ai+1 , hence also for any smooth enough /, by the density of {l/(z— •) : 
z G F}. 

This is an illustration of the peculiar effectiveness of the formula (39), for 
the divided difference of l/(z — •), for deriving and verifying divided difference 
identities. □ 
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Example 9. When the tj are pairwise distinct, (40) must reduce to 

n 

Mh:n)f = J2f( t i)/ Dw n,t(tj), (41) 

since this is readily seen to be the leading coefficient of the polynomial of degree 
< n that matches a given / at the n pairwise distinct sites t\, . . . , t n when we 
write that polynomial in Lagrange form, 

t-m) n j~T' 

3 = 1 iel:n\j 3 

It follows (see the proof of [Erdos et al. 1940: Lemma I]) that, for — 1 < 
ti < ■ ■ ■ < t n < 1, 

n 

|| A(ii :n ) : C{{-\ . . 1]) -+ F|| = l/\Dw n , t (tj)\ > 2"- 2 , (42) 

with equality iff w n _ t = (() 2 — 1){/„_2, where C/„_2 is the second-kind Chcbyshcv 
polynomial. 

Indeed, for any such 

t := (ti, . . . ,i„), 

the restriction A of A(r) to n<„ is the unique linear functional on n<„ that 
vanishes on n <n _i and takes the value 1 at () n_1 , hence takes its norm on the 
error of the best (uniform) approximation to () ,l_1 from n<„_i, i.e., on the 
Chebyshev polynomial of degree n — 1. Each such A(r) is an extension of this 
A, hence has norm > ||A|| = 1/ dist (Q"" 1 , n<„_i) = 2"- 2 , with equality iff 
A(r) takes on its norm on that Chebyshev polynomial, i.e., iff r is the sequence 
of extreme sites of that Chebyshev polynomial. □ 

8 The divided difference as approximate normalized derivative 

Assume that / is differentiate on an interval that contains the nondecreasing 
finite sequence 

T = (r < • • • < Tfc), 

and assume further that A(t)/ is defined, hence so is the Hermite interpolant 

Prf 

of / at t. 

Then / — P T f vanishes at T 0: fe, therefore D(f — P T f) vanishes at some 
(j = (do, • • • j ffe-i) that interlaces r, meaning that 

n < (Ti < Tj+i, all i. 

This is evident when r, = Tj for some i < j, and is Rolle's Theorem when 
n < Tj+i. Consequently, DP T f is a polynomial of degree < k that matches Df 
at <7o, • • • , crfe_i, hence must be its Hermite interpolant at a. This proves the 
following. 
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Proposition 43 ([Hopf 1926]). If f is diffcrcntiable on an interval that con- 
tains the nondecreasing (k + l)-sequence r and smooth enough at r so that its 
Hermite interpolant, P T f, at t exists, then there is a nondecreasing k -sequence 
a interlacing t and so that 

PADf)=DP T .f. 

In particular, then 

kA(r)f = A(a)Df. 

From this, induction provides the 

Corollary ([Schwarz 1881-2]). Under the same assumptions, but with f k 
times differentiablc on that interval, there exists £ in that interval for which 

MA(T ,...,T k ) = D k f(0- (44) 

The special case k = 1, i.e., 

A(a, b)f = Df{£), for some £ e (a . . b), 

is so obvious a consequence or restatement of L'Hopital's Rule, it must have 
been around at least that long. 

Chakalov [Tchakaloff 1934] has made a detailed study of the possible values 
that £ might take in (44) as / varies over a given class of functions. 

[A. Kowalewski 1917: p. 91] reports that already Taylor, in [Taylor 1715], 
derived his eponymous expansion (13) as the limit of Newton's formula, albeit 
for equally spaced sites only. 

9 Representations 

Determinant ratio. Let 

r := (r , . . . ,T fe ). 

Kowalewski's definition of A(r)/ as the leading coefficient, in the power form, 
of the Hermite interpolant to / at r gives, for the case of simple sites and via 
Cramer's Rule, the formula 

A(r)f = detQ T [()",..., (J*" 1 , /]/det Q T [()°, . . . , ()*] (45) 

in which 

Qr[go, ■ ■ -,9k] ■= ijajin) : i,j = Q,...,k). 

In some papers and books, the identity (45) serves as the definition of A(t)J 
despite the fact that it needs awkward modification in the case of repeated sites. 
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Peano kernel (B-spline). Assume that r := (to, ...,Tfc) lies in the 
interval [a . . b] and that / has k derivatives on that interval. Then, on that 
interval, we have Taylor's identity 

f(x) = ^> - a) j D j f(a)/j\ + [\x - y^D^y) dy/(k - 1)!. (46) 

3<k Ja 

If now t < Tfe, then, from Proposition 35, A(t) is a weighted sum of values of 
derivatives of order < k, hence commutes with the integral in Taylor's formula 
(46) while, in any case, it annihilates any polynomial of degree < k. Therefore 

A(r)f = [ M(-\r)D k f/k\, (47) 

with 

M{x\t) := kA(r)(- - xf' 1 (48) 

the Curry-Schoenberg B-spline (see [Curry & Schoenberg 1966]) with knots r 
and normalized to have integral 1. While Schoenberg and Curry named and 
studied the B-spline only in the 1940's, it appears in this role as the Peano 
kernel for the divided difference already earlier, e.g., in [Popoviciu 1933] and 
[Tchakaloff 1934] (sec [dc Boor et al. 2003]) or [Favard 1940]. 

Contour integral. An entirely different approach to divided differences 
and Hermite interpolation begins with Frobenius' paper [Frobenius 1871], so 
different that it had no influence on the literature on interpolation (except for a 
footnote-like mention in [Chakalov 1938]). To be sure, Frobenius himself seems 
to have thought of it more as an exercise in expansions, never mentioning the 
word 'interpolation'. Nevertheless, Frobenius describes in full detail the salient 
facts of polynomial interpolation in the complex case, with the aid of the Cauchy 
integral. 

In [Frobenius 1871], Frobenius investigates Newton series, i.e., infinite ex- 
pansions 

oo 

i=i 

in the Newton polynomials Wjj defined in (2). He begins with the identity 
a ready consequence of the observations 

XWj-i tt (x) = W jtt (x) + tjWj-i tt (x), 

V _ 1 tj 
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since these imply that 

W 3-lA X ) = Wj, t (x) | tjWj- lit {x) 

j^i WjAv) y 1 w jAv) ^ w iAv) 

Then (in §4), he uses (49), in the form 



sr- w 3-i,t(z) WnAz) = M(r-?\ 
U wjAO + (C-z)w n AC) w h 



in Cauchy's formula 

f(z\ = 

to conclude that 



/w-f-^ + ^a/ g-^o - (50) 

with 

1 / /(C) dC 
Cj -=7^9 ^777' j = l,...,n. 

For this, he assumes that z is in some disk of radius p, in which / is entire, and 
( runs on the boundary of a disk of radius p' < p that contains z, with none of 
the relevant tj in the annulus formed by the two disks. 

Directly from the definition of the divided difference, we therefore conclude 
that, under these assumptions on / and t, 

-55/^ i = 0,1,2 <51) 

Strikingly, Frobenius never mentions that (50) provides a general polynomial 
interpolant and its error. Could he have been unaware of it? To be sure, he 
could not have called it 'Hermite interpolation' since Hermite's paper [Hermite 
1878] appeared well after Frobenius'. There is no indication that Hermite was 
aware of Frobenius' paper. 

Genocchi-Hermite. Starting with (19) and the observation that 
A(x,y)f= f Df((l-s)x + sy)ds, 



G4 



C. de Boor 



induction on n gives the (univariate) Genocchi-Hermite formula 

A(r ,...,r n )/= f D n f, (52) 

J[ro,-,r n ] 

with 




/((l - si)t H h (s„_i - s„)t„_i + s„t„) ds„ • • • dsi. 



[Norlund 1924: p. 16] mistakenly attributes (52) to [Hcrmite 1859], possibly 
because that paper carries the suggestive title "Sur Finterpolation" . 

At the end of the paper [Hermite 1878] , on polynomial interpolation to data 
at the n pairwise distinct sites t\, . . . ,t n in the complex plane, Hermite does 
give a formula involving the righthand-side of the above, namely the formula 

f{x) - Pf{x) = (x-h)---(x- t n ) [ D n f 

J[t n , — ,ti,x] 

for the error in the Lagrange interpolant Pf to / at t\- n . Thus, it requires the 
observation that 

f{x) - Pf(x) = {x-t l )---{x- t n )A{t n , h,x)f 

to deduce the Genocchi-Hermite formula from [Hermite 1878]. (He also gives 
the rather more complicated formula 

f(x) - Pf(x) = 

(x - a x ) a ■ ■ ■ (x - a n ) x [ \s n - sn-i]" - 1 ■■■[!- s 1 j X - 1 D a +-+\f 

J[a n ,...,ai,x] 

for the error in case of repeated interpolation. Here, \zf := z J 

In contrast, [Genocchi 1869] is explicitly concerned with a representation 
formula for the divided difference. However, the 'divided difference' he repre- 
sents is the following: 

A(x, x + hi)A{-, ■ + h 2 ) ■ ■ ■ A(-, • + h n ) = (Ahjhi) ■ ■ ■ (A hn /h n ) 

and for it he gets the representation 




D n f(x + Mi + • • • + Mn) dti ■ ■ ■ dt 
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[Norlund 1924: p.16] cites [Genocchi 1878a], [Genocchi 1878b] as places 
where formulations equivalent to the Genocchi-Hermite formula can be found. 
So far, I've been only able to find [Genocchi 1878b]. It is a letter to Hermite, in 
which Genocchi brings, among other things, the above representation formula 
to Hermite's attention, refers to a paper of his in [Archives de Grunert, t. XLIX, 
3e cahier] as containing a corresponding error formula for Newton interpolation. 
He states that he, in continuing work, had obtained such a representation also 
for Ampere's fonctions interpolatoires (aka divided differences), and finishes 
with the formula 



■ s n -i 



Jo Jo 

D n f(x + si(xi — xo) H h sis 2 • • • s n (x n - x n -i)) dsi ■ ■ ■ ds n 

for A(xo, . . . , x n )f , and says that it is equivalent to the formula 

A(x , . . . , x n )f = j - J D n f(s x + sixi H s n x n ) ds x ■ ■ ■ ds n 

in which the conditions sq + • • • + s n = 1, Si > 0, all z, arc imposed. 

[Steffenscn 1927: p.l7f] proves the Genocchi-Hermite formula but calls it 
Jensen's formula, because of [Jensen 1894]. 

10 Divided difference expansions of the divided difference 

By applying A(si :m ) to both sides of the identity 

n 

obtained from (16), one obtains the expansion 

n 

A(S1 :TO ) = M S l--m)Wj-l,tA(tl:j)+E(s,t), 
j=m 

where, by the Leibniz formula (29), 

E(s,t) := A(s 1:m )(w n , t A(t 1:n ,-)) 

m 

= X A(s l . rn )w njt A(s 1:m ,t 1 .. n ). 

i=l 

But, following [Floater 2003] and with p := n — to, one gets the better 
formula 

m 

E ( s , t) ■= X!( Sl _ ti+ P ){A(s 1: i)wi +p , t )A(t 1: i+p, s i:m ) 

i=l 
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in which all the divided differences on the right side are of the same order, n. 
The proof (see [de Boor 2003a]), by induction on n, uses the easy consequence 
of (29) that 

(si - y)A(s i:m )f = A(s i:m )((- - y)f) - A(s i+1:m )f. 
The induction is anchored atn = m for which the formula 

m 

A(s 1:m ) - A(*i :m ) = ^(Si - ti)A(s 1:i , ti: m ) 
i=l 

can already be found in [Hopf 1926]. 

11 Notation and nomenclature 

It is quite common in earlier literature to use the notation 

[yi, •••,%•] 

for the divided difference of order j — 1 of data ((tj, : i = This reflects 
the fact that divided differences were thought of as convenient expressions in 
terms of the given data rather than as linear functionals on some vector space 
of functions. 

The presently most common notation for A(tuj)p — A(ti, . . . , tj)p is 

P[h, ■ ■ -,tj] 

(or, perhaps, p(ti, . . . , tj)) which enlarges upon the fact that A(z)p = p(z), but 
this becomes awkward when the divided difference is to be treated as a linear 
functional. In that regard, the notation 

[h,. . .,tj]p 

is better, but suffers from the fact that the resulting notation 

[ii,.. .,tj] 

for the linear functional itself conflicts with standard notations, such as the 
matrix (or, more generally, the column map) with columns ti, . . . , tj, or, in the 
special case j = 2, i.e., 

[tiM 

the closed interval with endpoints t\ and t2 or else the scalar product of the 
vectors t\ and t2- The notation 



[t x , . . .,tj;p] 
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does not suffer from this defect, as it leads to the notation [ti, . . . , tf, •] for the 
linear functional itself, though it requires the reader not to mistakenly read that 
semicolon as yet another comma. 

The notation, A, used in this essay was proposed by W. Kahan some 
time ago (see, e.g., [Kahan 1974]), and does not suffer from any of the defects 
mentioned and has the advantage of being literal (given that A is standard 
notation for a difference). Here is a Tj^X macro for it: 

\def \divdif {\mathord\kern.43em\vrule width. 6pt height5.6pt 
depth-. 28pt \kern- .43em\Delta> 

Although divided differences are rightly associated with Newton (because 
of [Newton 1687: Book hi, Lemma v, Case ii], [Newton 1711]; for a detailed 
discussion, including facsimiles of the originals and their translations, see [Fraser 
1918], [Fraser 1919], [Fraser 1927]), the term 'divided difference' was, according 
to [Whittaker et al. 1937: p.20], first used in [de Morgan 1842: p.550], even 
though, by then, Ampere [Ampere 1826] had called it fonction interpolaire, and 
this is the term used in the French literature of the 1800s. To be sure, in 
[Newton 1711], Newton repeatedly uses the term "differentia sic divisa'. 

References 

Ampere, Andre-Marie [1826] Essai sur un noveau mode d'exposition des princi- 
pes du calcul differentiel, du calcul aux differences et de l'interpolation des 
suites, considerees comme derivant d'un source commune, Annals de mathemat- 
icques pures et appliquees de Gergonne (Paris, Bachelier, in 4°) 16, 329-349. 

de Boor, C. [1972] On calculating with B-splines, J. Approx. Theory 6, 50-62. 

de Boor, C. [2003a] A divided difference expansion of a divided difference, J. 
Approx. Theory 122, 10-12. 

de Boor, C. [2003b] A Leibniz formula for multivariate divided differences, SIAM 
J. Numcr. Anal. 41(3), 856-868. 

de Boor, C. and A. Pinkus [2003] The B-spline recurrence relations of Chakalov 
and of Popoviciu, J. Approx. Theory 124, 115-123. 

Bulirsch, R. and H. Rutishauser [1968] Interpolation und genaherte Quadratur, 
in Mathematische Hilfsmittcl des Ingenieurs, Teil III, S. Sauer & I. Szabo, eds, 
Grundlchrcn vol. 141, Springer- Verlag, Heidelberg, 232-319. 

Cauchy, Augustin [1840] Sur les fonctions interpolaires, C. R. Acad. Sci. Paris 
11, 775-789. 

Tchakaloff, L. [1934] Sur la structure des ensembles lineaires clcfinis par une 
certaine propriete minimale, Acta Math 63, 77-97. 

Chakalov, L. [1938] On a certain presentation of the Newton divided differences 
in interpolation theory and its applications (in Bulgarian) , Annuaire Univ. Sofia, 
Fiz. Mat. Fakultet 34, 353-394. 



68 



C. de Boor 



Curry, H. B. and I. J. Schocnberg [1966] On Polya frequency functions IV: the 
fundamental spline functions and their limits, J. Analyse Math. 17, 71-107. 

Erdos, P. and P. Turan [1940] On interpolation, III. Interpolatory theory of 
polynomials, Appl. Math.(2) 41, 510-553. 

Favard, J. [1940] Sur Interpolation, J. Math. Pures Appl. 19, 281-306. 

Floater, M. [2003] Error formulas for divided difference expansions and numer- 
ical differentiation, J. Approx. Theory 122, 1-9. 

Fraser, Duncan C. [1918] Newton's interpolation formulas, J. Instit. Actuaries 
LI, 77-106. 

Fraser, Duncan C. [1919] Newton's interpolation formulas. Further Notes, J. 
Instit. Actuaries LI, 211-232. 

Fraser, Duncan C. [1927] Newton's interpolation formulas. An unpublished 
manuscript of Sir Isaac Newton, J. Instit. Actuaries LVIII, 53-95. 

Frobenius, G. [1871] Uber die Entwicklung analytischcr Functionen in Reihen, 
die nach gegebenen Functionen fortschrcitcn, J. rcinc angew. Math. 73, 1-30. 

Genocchi, A. [1869] Relation entre la difference ct la derivee d'un meme ordre 
quelconque, Archiv Math. Phys. (I) 49, 342-345. 

Genocchi, A. [1878a] Intorno alle funzioni interpolari, Atti della Reale Ac- 
cademia delle Scienze di Torino 13, 716-729. 

Genocchi, A. [1878b] Sur la formule sommatoire de Maclaurin et les fonctions 
interpolaires, C. R. Acad. Sci. Paris 86, 466-469. 

Hermite, Ch. [1859] Sur l'interpolation, C. R. Acad. Sci. Paris 48, 62-67. 

Hermite, Ch. [1878] Formule d'interpolation de Lagrange, J. reine angew. Math. 
84, 70-79. 

Hopf, Eberhard [1926] "Uber die Zusammenhange zwischen gewissen hoheren 
Diffcrenzenquotienten reeller Funktionen einer reellen Variablen und deren Dif- 
ferenzierbarkeitseigenschaften" , dissertation, Univcrsitat Berlin (30 pp). 

Jensen, J. L. W. V. [1894] Sure une expression simple du reste dans la formule 
d'interpolation de Newton, Bull. Acad. Roy. Dancmark xx, 246-xxx. 

Kahan, W. [1974] Divided differences of algebraic functions, Class notes for 
Math 228A, UC Berkeley, Fall. 

Kowalewski, Arnold [1917] "Newton, Cotes, Gauss, Jacobi: Vier grundlegende 
Abhandlungen uber Interpolation und genaherte Quadratur" , Teubner, Leipzig. 

Kowalewski, G. [1932] "Interpolation und genaherte Quadratur" , B. G. Teubner, 
Berlin. 

de Morgan, A. [1842] "The Differential and Integral Calculus", Baldwin and 
Cradock, London, UK. 

Newton, I. [1676] Epistola posterior (Second letter to Leibniz via Oldenburg), 
24 oct. 



Divided Differences 



69 



Newton, I. [1687] "Philosophiae naturalis principia mathematica" , Joseph Strea- 
ter, London, (see 

ftp : //ftp .math. technion. ac . il/hat/f papers/newtonl .pdf 
ftp : //ftp .math. technion. ac . il/hat/f papers /newt on2 .pdf 

Newton, I. [1711] "Methodus differentialis" , Jones, London. 

Norhmd, N. E. [1924] "Vorlesungen uber DifFerenzenrechnung" , Grundlehren 
Vol. XIII, Springer, Berlin. 

Opitz, G. [1964] Steigungsmatrizen, Z. Angew. Math. Mech. 44, T52-T54. 

Popoviciu, Tibere [1933] "Sur quelques proprietes des fonctions d'une ou de 
deux variables reelles", dissertation, presented to the Faculte des Sciences de 
Paris, published by Institutul de Arte Grafice "Ardealul" (Cluj, Romania). 

Popoviciu, Tiberiu [1940] Introduction a la theorie des differences divisees, Bull. 
Mathem., Societea Romana de Stiinte, Bukharest 42, 65-78. 

Schwarz, H. A. [1881-2] Demonstration elementaire d'une propriete fondamen- 
tale des fonctions interpolates, Atti Accad. Sci. Torino 17, 740-742. 

Steffensen, J. F. [1927] "Interpolation" , Williams and Wilkins, Baltimore. 

Steffensen, J. F. [1939] Note on divided differences, Danske Vid. Selsk. Math- 
Fys. Medd. 17(3), 1-12. 

Taylor, Brook [1715] "Methodus incrementorum directa et inversa" , London, 
England. 

Whittaker, E. T. and G. Robinson [1937] "The Calculus of Observations, A 
Treatise on Numerical Mathematics, 2nd ed." , Blackie, Glasgow, UK. 

C. de Boor 
P.O. Box 1076 
Eastsound WA 98245 USA 
deboorQcs . wise . edu 



and 



ittp : //www. cs . wise . edu/ 



~deboor/ 



